Assessing the Generalizability of Deep Learning Models Trained on Standardized and Nonstandardized Images and Their Performance Against Teledermatologists: Retrospective Comparative Study
نویسندگان
چکیده
Background Convolutional neural networks (CNNs) are a type of artificial intelligence that shows promise as diagnostic aid for skin cancer. However, the majority trained using retrospective image data sets with varying capture standardization. Objective The aim our study was to use CNN models same architecture—trained on acquired either device and technique (standardized) or varied devices techniques (nonstandardized)—and test variability in performance when classifying cancer images different populations. Methods In all, 3 CNNs architecture were trained. nonstandardized (CNN-NS) 25,331 taken from International Skin Imaging Collaboration (ISIC) devices. standardized (CNN-S) 177,475 MoleMap device, number 2 (CNN-S2) subset (matched classes training CNN-NS). These then tested external sets: 569 Danish images, publicly available ISIC 2020 set consisting 33,126 University Queensland (UQ) 422 images. Primary outcome measures sensitivity, specificity, area under receiver operating characteristic curve (AUROC). Teledermatology assessments used determine model compared teledermatologists. Results When CNN-S achieved an AUROC 0.861 (95% CI 0.830-0.889) CNN-S2 0.831 0.798-0.861; models), both outperforming CNN-NS (nonstandardized model; P=.001 P=.009, respectively), which 0.759 0.722-0.794). additional (ISIC UQ), (P<.001 P<.001, respectively) (P=.08 P=.35, still outperformed CNN-NS. matched mean sensitivity specificity teledermatologists set, models’ resultant sensitivities specificities surpassed by CNN-S, differences not statistically significant (sensitivity: P=.10; specificity: P=.053). Performance across all well influenced quality. Conclusions had improved and, therefore, greater generalizability classification applied unseen sets. This finding is important consideration future algorithm development, regulation, approval.
منابع مشابه
the relationship between learners critical thinking ability and their performance in the reading sections of the tofel and ielts test
the study reflected in this thesis aims at finding out relationships between critical thinking (ct), and the reading sections of tofel and ielts tests. the study tries to find any relationships between the ct ability of students and their performance on reading tests of tofel and academic ielts. however, no research has ever been conducted to investigate the relationship between ct and the read...
15 صفحه اولthe effect of explicit teaching of metacognitive vocabulary learning strategies on recall and retention of idioms
چکیده ندارد.
15 صفحه اولthe effects of planning on accuracy and complexity of iranian efl students’ written narrative task performance
this study compared the different effects of form-focused guided planning vs. meaning-focused guided planning on iranian pre-intermediate students’ task performance. the study lasted for three weeks and concentrated on eight english structures. forty five pre-intermediate iranian students were randomly assigned to three groups of guided planning focus-on-form group (gpfg), guided planning focus...
15 صفحه اولthe effects of time planning and task complexity on accuracy of narrative task performance
هدف اصلی این تحقیق بررسی تاثیر برنامه ریزی زمانی، هم چنین افزایش میزان پیچیدگی تکالیف در نظر گرفته شده بصورت همزمان، بر دقت و صحت و پیچیدگی عملکرد نوشتاری زبان آموزان می باشد. بدین منظور، 50 نفر از دانش آموزان دختر در رده ی سنی 16 الی 18 سال به عنوان شرکت کنندگان در این زمینه ی تحقیق در نظر گرفته شدند و به دو گروه آزمایشی و کنترل بصورت اتفاقی تقسیم شدند. اعضای گروه آزمایشی هر دو تکلیف ساده و پی...
investigating the effect of motivation and attitude towards learning english, learning style preferences and gender on iranian efl learners proficiency
تحقیق حاضر به منظور بررسی تاثیر انگیزه و نگرش نسبت به یادگیری زبان انگلیسی، ترجیحات سبک یادگیری و جنسیت بر بسندگی فراگیران ایرانی زبان انگلیسی انجام شد. برای این منظور، 154 فراگیر ایرانی زبان انگلیسی در این تحقیق شرکت کردند. سه ابزار جمع آوری داده ها شامل آزمون تعیین سطح بسندگی زبان انگلیسی آکسفورد، پرسشنامه ترجیحات سبک یادگیری براچ و پرسشنامه انگیزه و نگرش نسبت به یادگیری زبان انگلیسی به م...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: JMIR dermatology
سال: 2022
ISSN: ['2562-0959']
DOI: https://doi.org/10.2196/35150